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SUMMARY 

dl8cu88ion  of  a  few  typical  theories  of  decision  making 
In  small  human  groups,  and  description  of  a  few  pilot  experi¬ 
ments  Illustrating  the  kind  of  theoretical  end  experimental 
problems  that  are  met  in  testing  these  theories.,  ! 

\ 
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1 .  Iif)tro<5uctlon 

This  paper  propoaes  a  class  of  rnathematlcel  models  to 
represent  the  behavior  of  a  ^^roup  of  one  or  more  Individuals 
engaged  In  a  decision-making  process.  In  one  sense,  this  class 
of  models  can  be  thought  of  as  a  general  theory  of  organization 
whose  parameters  permit  description  of  a  wide  variety  of  be¬ 
havioral  situations.  Tliere  is  also  some  discussion  of  the 
experimental  proDlems  that  are  met  in  the  attempt  to  test  the 
theory  for  applicability  to  everyday  human  situations. 

Ihe  von  Neumann-Morgenstem  [l]  formulation  of  the  choice 
situation  as  an  n-person  game  in  normal  form  with  a  finite  set 
of  choices  available  to  each  person  at  each  moment  of  time,  and 
with  a  valuation  of  the  expected  outcome  for  each  person  and 
each  set  of  choices.  Is  accepted  as  the  underlying  description 
of  the  behavioral  alternatives  available  to  the  group.  The 
Bush— Hosteller  [2]  formulation  ti.*^  learning  situation  for  a 
single  person,  treated  as  a  stocheat’c  process,  is  accepted  as 
the  underlying  descvlption  of  individual  behavior  In  a  repeti¬ 
tive  choice  situation.  The  Balea-Householder  [3]  formulation 
of  the  group  interaction  process,  treated  as  a  stochastic 
process,  is  accepted  as  the  underlying  description  of  gi-O’ip 
behavior  in  a  repetitive  choice  situation.  The  class  of  models 
presented  in  this  paper  is  essentially  a  synthesis  of  the 
essentials  from  these  three  broad  theories  so  as  to  ylela  an 
organization  theory  general  enough  to  relate  to  the  dynamic 
valuation,  learning,  interaction,  and  decision  processes 
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observed  in  group  decislon-naklng. 

A  few  pilot  experiments  have  been  conducted  to  test  various 
special  features  of  the  general  model.  One  central  problem 
concerns  the  methods  used  by  a  coalition  of  persons  to  establish 
the  division  of  rewards,  derived  by  forming  the  coalition,  among 
its  several  members;  seme  experimental  results  ^n  connection  with 
this  problem  have  been  reported  elsewhere  [4j .  In  their  most 
common  form,  the  stociiastlc  learning  models  describe  behavior  in 
terms  of  a  dichotomy  of  payoffs,  termed  sln^jly  "reward"  and 
"punishment' ;  the  conversion  of  the  game-theoretic  model,  with 
Its  many-valued  payoff  functions,  to  this  dichotomous  form  has 
been  discussed  elsewhere  [3]  together  with  some  experimental 
results  pertinent  to  this  aspect  of  the  problem.  Several  ex¬ 
periments  designed  to  provide  parameter  estimates,  and  tests  of 
validity,  have  shown  the  stochastic  learning  models  to  be  pro¬ 
mising  for  description  of  Individual  human  learning;  a  sample  of 
these  experimental  results  and  references  to  other  authors  may 
be  found  In  an  earlier  paper  [6] .  One  especially  difficult  task 
is  the  prescription  of  the  rul.'*  determining  group  decision  In 
terms  of  individual  choices,  Involving  such  questions  as  major¬ 
ity  rule  and  the  like,  and  many  recent  authors  have  discussed 
various  aspects  of  this  problem;  a  few  experimental  results 
along  these  lines  have  been  reported  in  sn  earlier  series  of 
papers  [?] . 

The  present  paper  disposes  of  the  troublesome  game-theoretlo 
problem  of  coalition  and  division  by  letting  these  results  be 
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d«termlned  atocht^stlcsily  oy  other  properties  of  the  group 
decision  model,  'fhe  payoff  fanctlons  are  generally  not  known 
to  the  Individuals  at  the  outset  of  their  interaction,  and  are 
only  partially  learned  during  a  sequence  of  group  decisions,  so 
that  the  model  reflects  this  human  feature  of  learning  values 
from  outcomes  of  similar  acts  previously  experienced.  Unfor¬ 
tunately,  It  has  seemed  necessary  to  make  an  arbitrary  selection 
of  the  rule  for  determining  group  decision  in  terms  of  individual 
choices;  this  Imposition  of  a  rule  of  conduct  on  the  group  by  a 
superior  authority  may  or  may  not  be  realistic,  and  is  a  feature 
of  the  model  worth  further  attention. 

A  first  very  crude  pilot  experiment  waa  conducted  to  see  if 
the  kind  of  experiment  proposed  In  this  paper  seems  interesting 
on  purely  subjective  grounds.  As  one  of  the  two  subjects,  I 
found  the  situation  to  be  surprisingly  realistic  in  the  sense 
that  I  had  real  dlffi%-ilty  in  making  the  choices  and  felt  that 
the  two  of  us  should  be  able  to  communicate  and  cooperate  so  as 
to  increase  our  returns.  More  extensive  experiments  arc  in 
progress  but  it  seems  likely  that  they  will  only  serve  for  some 
time  to  come  as  a  means  toward  Improvement  of  the  .model  and  the 
methods  of  testing. 

Perhaps  this  paper  is  a  bit  premature.  It  presents  no 
striking  results,  either  of  an  experimental  or  theoretical 
nature.  But  the  key  effort  is  synthesis  of  various  mathematical 
theories  pertaining  to  components  into  a  first  crude  general 
theory  of  organization.  It  is  hoped  that  others  working  in 
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related  fields  may  find  enough  cf  Interest  hei'e  to  be  of  help 
In  furthering  such  a  ayntheals. 

2.  The  e>roup  decision  model 

I  shall  define  the  decision  models  only  for  groups  with 
two  members,  leaving  the  rather  straightforward  extensions  to 
larger  groups  for  another  paper.  The  two  members  are  assumed 
to  be  interested  In  the  same  set  of  alternatives,  and  their 
choices  at  each  moment  of  time  are  determined  both  by  their 
Individual  preferences  and  by  the  Interaction  between  members 
in  8  manner  now  to  be  defined  precisely. 

Individual  1  has  an  m-component  stochastic  preference 
vector  p^(t)  at  moment  t  such  that  he  will  select  alterna¬ 
tive  X  with  ...robabllity  P^(t)  If  he  acts  alone.  This 
vector  satisfies  the  stochastic  relation; 


2.1)  p^(t+l)  >  L^(l^,  a^J  p^(t). 


where  L^(l^,  3^)  Is  an  operator  that  carries  a  stochastic 
vector  Into  a  stochastic  vector,  1^  is  the  actual  choice  made 
at  moment  t,  and  s^  is  a  value  attached  to  the  consequences 
following  choice  1^.  In  this  paper,  the  model  will  be  taken 
in  a  special  form,  as  follows: 

1 }  3^  is  either  1  or  0  according  fcs  the 


c'  • 


choice  Is  rewarded  or  non-rewarded, 

2)  L  (1^,  s^j  -  s^R  P  ,  where 


11, 


11, 


n 


end  P 


are  stochastic  matrices. 
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Member  1  has  a  set  of  3-coraponent  stochastic  attitude 

i  1x.  / 

vectors  (t;  at  mo-nent  t,  for  alternative  x,  and  for 

source  J,  such  that  he  will  decide  on  action  r  with  probab— 
i  ix 

lllty  d^*^  (t).  These  vectoro  satisfy  the  stochastic  relations: 

2.3)  d^^*(t+l)  =  , 

where  ,x.  )  ie  an  operator  that  cBr"l'‘3  a  atoch— 

astlc  vector  into  a  stochastic  vector .  Whenever  member  1 

must  make  a  proposal  to  himself  he  does  als  with  probab.;  11  ty 

i  lx 

I  for  alternative  y.  In  a  sentence,  d"*^  (t)  Is  the  pro— 

w  “ 

bablllty  that  1  will  decide  on  r  after  J  proposes  alter¬ 
native  X  to  1  at  moment  t.  Similarly,  J^,r^,x^ ; 

i  1x 

is  applied  to  d  *'  (tj  after  1^  actually  decides  on  action 
r.  sfter  J,.  proposes  eltematlve  x,.  to  1.  at  moment  t. 

V  V  V  W 

In  this  paper .  the  model  will  be  taken  in  a  special  form,  as 
follows : 


1  j 

are 

stochastic  matrices, 

2j 

>  1 

If 

*  *’ 

5; 

-  1 

If 

'’t  * 

-  1 

If 

It  -  Jt  ^  ^  when  r^  4  0 

These  four  special  restrictions  may  be  Interpreted  as  follows: 
2.4.1)  This  is  8  specialization  of  mathematical  rcrra,  sc 
that  the  oper.  tors  are  matrices  with  non— negative 
elements  and  with  columns  that  sum  to  unity. 


P-^12 


2.4.2) 


2.4.4) 


The  ranges 


No  attitudes  toward  one  alternative  are  changed  by 
reaction  to  the  proposal  of  some  other  alternative. 
No  attitudes  are  changed  if  the  reaction  la  simply 
to  question  a  proposal. 

The  attitudes  of  one  member  ere  not  changed  as  a 
consequence  of  the  reaction  of  another  member  to  his 
own  proposal. 

of  the  variables  are  limited  as  follows: 


x,x, 


0.1.2. 

1.2. . . . .m. 
0.1.^, ...,N, 


The  three  possible  actions  may  be  thought  of  as  question, 
accept,  or  reject  for  r  =  C,i,2.  respectively.  In  further 
specialization,  it  Is  assumed  that: 


1 ) 


4 

’  '^t 


''‘t> 


-  \ 
j 


( 


0  C  \ 


/ 


and 


It  will  usually  be  convenient  to  omit  x 


in  writing 
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1  ^ 

the  D-matrIces  and  to  write  simply  D  i even  mcrj 

simply  D^'^(tj. 

We  must  now  descri'^e  the  order  of  jnteractlcn  in  order  to 
specify  the  sequence  of  operations  with  the  D-raatrices.  The 

interaction  is  initiated  by  member  with  probability  q,  , 

^  i  "c 

by  selecting  alternative  x  with  probability  .  He  then 

1  i  X  ^ 

decides  cwi  action  r^  with  probability  d^^  ^  ^(0).  Both 

members  now  apply  operators  ^o’ ^o' “o ' 

their  d^'^*(0)  to  obtain  their  d-‘^^(l).  If  r^  |  1  this 

c 

Initial  sequence  is  repeated  until  at  some  raoraent 
happens  for  the  first  time  that  r.  »  1;  then  loth  players 

apply  operators  )  to  their  d^’^*(t^)  to  obtain  their 

d^’^*(ti  +  1).  Now  the  other  player  i,.  ^  t  t  decides 

on  action  r*.  .  t  with  probability 

^1+  X 

^t,+  1  H,*t, 

^t^+  1  ^ 


,1  Jx, 


and  Doth  members  then  apply  operators  “  (l^+l,  to  their 
d^'^*(t,+  l)  to  obtain  their  d"" t,  +  2  ;  .  If  i\  ■,  »  2,  then 

jL  X  N,  ^  “T  1 

,  selects  alternative  x.  ^  with  proLailllty 
t^+  1  ^  ^ 


t^>  1 


and  the  sequence  c  operaticns  continues  fcr  ^  ^  in 
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1^  at  the  start-  with 
o  ' 


manner  exactly  analOj^ons  to  that  for 
the  roles  of  the  two  members  Interchanged.  On  the  other  hand. 


if  +  1  4  2,  then  the  action  reverts  to  member  +2" 
who  repeats  the  entire  sequence  of  steps  starting  with  an  al— 

selected  with  probability 


tematlve  x 


t,+  2 


^t^+  2 

This  pattern  of  actions  is  shown  concisely  In  the  flow  diagram 
following. 


Plow  Dlagi»am 


©- 


ISelect  — >  Decide 


Question 


-  D  — 


>  I  Accept  -  D  — >  [Interchange 


^  [Reject!  —  D  — 


Receive 


— >  Decide 


j^lQuestlorj  -  D  — | Interchange) — (a) 


>  jAcoeptj  -  D  — llnterohan'^i  — © 

© 


Reject!  -  D  — 


_  D  _ =  Apply  appropriate  D— matrices  to  d— vectors  for  both 

memt^ers,  and  pass  from  t  to  t  +  1. 

T/iere  ai’e  many  ways  In  which  we  could  combine  the  leam^-ng 
and  Interaction  models  to  get  a  group  decision  model.  We  shall 
identify  I'  with  p^(T),  and  think  of  the  Intervals  between 


@©@ 
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consecutive  values  of  T  as  interaction  times  involving  moments 
t  -  0  to  t  «  N(T);  hence,  the  sequence  of  events  is  an  Inter¬ 
action  stage  f'^ora  T*t«0  to  t»  N(0)  resulting  In  a  Joint 
decision  ly  both  members  to  choose  an  alternative  x(l)j  as  a 
conseqxAence  of  this  choice,  the  preference  vectors  p^(0)  are 
modified  according  to  2.1)  and  the  next  Interaction  stage  occurs 
starting  with  1^(1)  «  p^(l).  The  rule  for  a  J^lnt  decision  is 
here  taken  simply  to  be  acceptance  by  one  member  of  a  proposal 
by  the  other,  although  stronger  requirements  such  as  two  (or 
more)  consecutive  acceptances  of  the  same  alternative  could  be 
Imposed  It  is  not  assumed  that  one  member  is  always  rewarded 
if  and  only  if  the  other  ^mber  Is  rewarded;  the  two  schedules 
of  reward  and  non-reward  may  be  completely  Independent,  or  they 
may  be  correlated.  After  choice  It  Is  assumed  that  the 

Interaction  process  starts  over  again  with  the  same  Initial 
values  d  (0),  and  that  the  probability  Is  always  q^  that 
«.ember  1  will  be  the  first  to  make  a  proposal;  of  course 
qi  +  q^  -  1 .  All  these  quite  arbitrary  assumptions  have  been 
made  in  older  to  end  up  with  a  single  group  decision  model  vdiose 
properties  can  be  studied  mathematically  and  whose  applicability 
can  be  tested  experimentally. 

5 .  A  group  experiment 

I  shall  propose  a  type  of  group  experiment  that  could  pro¬ 
vide  the  data  necessary  for  estimating  values  of  the  parameters 
entering  into  the  two-ciember  group  decision  model  defined  In 
^  2,  hereafter  called  Model  G.  The  extension  to  more  than  two 
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mcml-ers  Is  involved  but  straight  forward,  end  will  be  postponed 
until  later. 

Two  subjects  play  a  game  together  with  the  following  rules 
for  moves  and  payoffs: 

Move  1 ■  A  chance  move  determining  that  Subject  i^  has 
Move  ?  with  probability  q.  ,  where  q.  +  q.^  »  1. 

il  X  d 

Move  2.  Subject  i^  sends  to  the  other  subject  s  positive 
integer  <  m. 

Move  3.  Subject  4  sends  one  of  the  following  three 
messages  to  Subject  i^^: 

0.  Pass, 

1.  Accept 

2.  Reject  Xj^  and  propose  x^  +  x^,  where 
0  <  X2  <  m.  He  also  pays  the  referee  sn  amount 
2V  if  he  rejects,  and  if  he  passes. 

Move  4. 

*•  If  Move  ^.1,  the  referee  pays  one  unit  to 

Subject  i  with  probability  ,  or  receives 
one  unit  from  Subject  1  otherwise,  and  this 
terminates  a  play  of  the  game, 

If  Move  3.2,  then  Subject  1^  sends  one  of 
the  following  three  messages  to  Subject  1...; 

C 

0.  Pass, 

1.  Accept  x^f 

2.  Reject  x^  »nd  propose  x^  4  x^. 
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Move 


He  alBo  pays  the  referee  2V  If  he  rejects^ 
rnd  V  If  he  passes. 

If  Move  3»0t  then  Subject  1^  sends  an 
to  Subject  as  for  Move  2. 


a.  If  Move  4b. 1^  the  referee  pays  one  unit  to 

Subject  1  with  probability  ,  or  receives 

*2 

one  unit  from  Subject  i  otherwise,  and  this 
terminates  a  play  of  the  game, 

b.  Tf  Move  4b. 2,  then  Subject  has  the  op¬ 

tions  of  Subject  i^  listed  in  Move  4l  ?. 

c.  If  Move  4b. 0,  then  Subject  i^  has  the  op¬ 


tions  of  Subject  i^  listed  in  Move  2. 


Moves  6—£. 


A  play  of  the  game  is  either  terminated  by  a 
sequence  of  moves  like  Moves  1-5,  by  one  player 
accepting  a  proposal  of  the  other  within  i  ^  moves, 
or  a  play  is  terminated  at  the  move. 

The  quantities  are  actually  chosen  so  as  to  represent 

the  payoffs  In  a  2-per8on  game,  »4iere  x  provides  an  enumeration 

of  tiie  strategy-pairs  available  to  the  two  players.  For  example. 

If  the  payoff  matrices  for  the  game  in  normal  form  are  p!,  , 

for  (a  «  1,2,...,  a;  p  ■  1,2,...,  b),  then  x  Is  some 

enumeration  of  the  at  pairs  (a,  5).  The  payoff  units  are 

chosen  so  that  Ip^aI  <  ^  ^-h  the  x-enumeretlon,  »  ilZl  . 

c*p  —  X  2 

For  some  purposes,  It  may  be  easier  to  use  the  a5  aenumerstlon 
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and  we  ahail  denote  it 

ap 

A  set  of  specific  values  will  now  be  assigned  for  the  con¬ 
stant  cyusntltiea  in  the  experiment,  as  an  illustrative  example, 
8S  follows: 

a.  -  1/4,  Qg  -  5/4, 

c.  1  unit  a  1  cent 

d.  V  -  O'.l, 


1.1 

1.2 

2,2 

2.1 

X 

1 

2 

^  ^ 

5 

4 

6vJ 

2 

6 

4 

00 

8 

6 

2 

5 

The  best  that  the  two  players  can  do  jointly,  as  partners,  is 
always  to  choose  alternative  2  with  an  expected  net  gain  per 
play  for  Subject  1  of  i/4  cent  and  for  Subject  2  of  1/2  cent; 
what  they  will  or  should  actually  do,  with  sldepayacnta  prohibit¬ 
ed,  is  an  open  question  within  the  game— theoretic  framework. [4] . 

The  experiment  should  be  arranged  so  that  neither  subject 
ever  knows  the  identity  of  the  other,  nor  the  actual  payoffs 
made  to  the  other.  Tliey  communicate  with  each  other  only  through 
the  strictly  limited  language  represented  by  the  words:  accept, 
reject,  pass,  propose  x,  etc.  They  are  both  told  that  what  they 
do  affects  the  outcome  for  both,  but  that  a  choice  that  is  pro¬ 
fitable  for  one  subject  may  also  be  profitable  for  the  other. 
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They  are  urged  to  behave  so  as  to  maximize  their  total  indivi¬ 
dual  net  receipts  in  a  series  of  100  plays  and  are  told  that 
they  should  both  come  out  ahead  of  the  game  if  they  get  together 
properly,  although  one  subject  may  profit  more  than  the  other. 

4.  Parameter  estimation 

Hie  group  decision  Model  G  may  be  thought  of  as  a  function 
CKt,0)  that  will  yield  a  sequence  x  (t)  of  choices  for 
t  -  1,2,***,  N  among  m  +  1  alternatives  where  is 

the  case  in  which  no  acceptance  is  obtained  within  Z  ^  moves. 
The  quantity  0  represents  a  set  of  parameters  Bp 

and  x^(t)  has  a  probability  distribution  F(x^j  0)  that 
determines  the  relative  frequency  with  which  each  possible 
sequence  x**  may  be  expected  in  repeated  applications  of  0 
producing  aequenoes  of  N  choices  each. 

Hie  group  experiment  propoaed  in  ^3  also  yields  a  set  of 
sequences  x”,  say  with  observed  frequencies  f(x  ),  so  the 
estimation  problem  is  that  of  determining  the  particular  set  0 
of  parameter  values  that  makes  for  the  best  agreement  between 
p(x**,  0)  and  f(x^).  Standard  statistical  methods,  such  as 
maximum  likelihood  or  moments,  may  oe  used  to  obtain  the  re— 
qulred  estimates  0. 

Hie  parameters  0  that  are  involved  are  as  follows: 
p^(0)  for  J  -  1,2;  1  -  1,2, 3, 4. 


I 


dj^^{0)  for  i,J  -  1,2;  k  -  0,1,2;  X  -  1,2, 3, 4. 


for 


01  a  1>2. 
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a 


for  1,J  =  1,2;  a,p  m  0,1,2. 


niest  are  not  independent,  and  there  ar-e  liialta  on  the  ransom 
allowed  for  acme  of  the  parameters;  for  example,  p^<0)  axKl 
d^^*(0)  are  atochaatic  vectors  and  and  are  atoohaatl^ 
matricea.  I  shall  now  reduce  the  number  of  parameters  involved, 
by  choosing  values  for  some  of  them  rather  arbitrarily;  it  will 
still  be  very  difficult  to  carry  out  the  estimation  eoaputationa 
even  with  this  reduced  set.  Specifie^ly,  it  is  assumed  that: 


1) 

1/4  for  J  .  1,2; 

i  .  1,2.J.». 

rd^->*(o) 

-  dg^^CO)  -  0  and 

d^’-*(0)  -  1  for  1,J  -  1,2; 

2)  < 

1 

X  -  1.2,3.‘». 

-  d^  for  i  +  J  - 

1,2;  X  -  1,2,3,». 

3) 

•a  *  •' 

b  «  b  for  a  - 
a 

1,2. 

4) 

.  o’ 

ap 

for  i  •  1,2;  a,p,x  -  1,2, 3#^. 


This  leaves  us  with  six  parameters,  all  limited  to  the  olosed 
Interval  (0,1);  a,  b,  c^,  c^,  d^,  and  d^. 


P-512 

11-1-52 

-15- 


The  asBuisptlcms  of  4.1)  may  be  justified  aomeidiat,  as 
follows : 

4.1.1)  At  the  outset  neither  subject  would  seem  to 
have  any  reason  for  choosing  one  alternative  instead  of 
another,  and  so  they  are  assusied  equally  likely  at  the 
start.  Indeed,  by  prior  conditioning,  the  subjects  can 
surely  be  brought  to  this  initial  state. 

4.1.2)  There  seems  to  be  no  reason  for  ti.a  subject 
idio  moves  first  to  pass  any  alternative,  nor  to  reject  any 
alternative  proposed  to  himself.  We  are  left  with  a  para¬ 
meter  d^  that  measures  the  initial  "agreeability "  of 
player  i,  and  this  msy  well  differ  from  subject  to  sub¬ 
ject. 

4.1.3)  The  parameters  a^  and  determine  the 

rates  at  which  the  attitude  vectors  are  modified  after 
each  move,  and  making  them  independent  of  a  amounts  tc 
assuming  that  an  acceptance  makes  the  next  acceptance  juct 
as  much  more  likaly  as  a  rejection  makes  the  next  rejec¬ 
tion  more  likely.  Of  course,  we  might  have  done  better  to 
allow  the  parameters  in  and  to  depend  upon  1 

so  as  to  provide  for  variation  between  subjects. 

4.1.4)  This  is  the  "mixed"  learning  model,  and  it  has 
been  chosen  because  it  has  seesMd  to  fit  the  data  from 
various  choice  experiments  better  than  other  single-para¬ 
meter  models  when  the  subject  is  convinced  that  the  system 
of  rewards  may  be  varied  from  time  to  time  sc  thet  hit  psst 
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experience  is  not  a  too  reliable  indicator  of  the  future. 
The  parameter  measures  the  rate  at  which  subject  i 

changes  hia  preferences  aa  a  consequence  of  reward  or  non- 
reward;  the  use  of  the  saoM  c^  in  both  and 

means  that  the  rate  of  change  with  reward  is  taken  to  be 
the  sane  as  the  rate  of  ciiange  with  non-reward,  the  so- 
called  "equal  alpha"  case  in  the  Bush-Mosteller  theory. 

We  must  have  a  measure  of  reward  in  order  that  r^ 
be  defined.  The  net  amount  actually  received  by  member  1 
for  play  T  is  either  >1  or  —1,  less  the  total  payments 
P^(T,  he  makes  for  his  rejections  and  paaaes,  so  the 
limita  on  his  net  receipts  R^(T)  are  such  that: 

1  >  R^(T)  >  -  (1  +  lOV). 

The  sin^lest  way  is  to  Ignore  the  costs  P^(T},  as  I 
shall  do  in  this  paper,  and  suppose  that  their  effect  is 
merely  to  alter  the  values  observed  for  a°  and  b®.  One 
alternative  would  be  to  take  account  of  the  costs  P^(T) 
by  a  separate  application  of  the  operator  P^*  with  pro¬ 
bability  if  X  was  choeen  on  play  T.  In  the  ex— 

lOV 

periment  described  in  §5  the  charges  P^(T)  were  In¬ 
cluded  to  sisulate  the  "hurt"  experienced  by  a  member  of  a 
group  when  he  rejects  or  quest lens  a  proposal  made  by 
another,  and  it  is  consistent  with  this  simulation  to  let 
any  effects  of  the  P^(T)  be  reflected  in  the  a®  and 
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as  I  have  done. 

3.  Experimental  design 

There  are  many  different  experiments,  of  course,  that  would 
yield  data  adequate  for  parameter  estimation  and  test  cf  (good¬ 
ness  of  fit  for  each  ^^oup  decision  model.  As  experience  Is 
trained  with  the  mathematical  characteristics  of  the  model,  and 
with  the  experimental  difficulties  Inevitably  met,  the  experi¬ 
menter  will  be  better  and  better  able  to  choose  amon^  the  many 
available  denigaB.  1  can  here  only  discuss  certain  general 
considerations  Ghat  will  help  to  determine  the  selection  of 
early  designs. 

Even  s  model  as  simple  as  Q,  with  Its  very  severe  restric¬ 
tions  on  the  parameters.  Involves  estimation  of  six  parameters 
In  each  type  of  experimental  situation.  It  will  be  desirable. 

If  Indeed  It  la  not  necessary,  to  select  a  sequence  of  experi¬ 
ments  that  will  provide  estimates  rf  Just  one,  or  at  most  a  very 
few,  of  the  parameters  In  each  experiment.  For  exan5)le,  each 
Individual  might  be  first  Investigated  in  a  simple  learning  ex— 
perlsient  so  as  to  estimate  the  parameters  entering  Into  his 
learning  operator.  Next  the  parameters  entering  Into  Ills  atti¬ 
tude  operators  might  be  estimated  one  or  two  at  a  time  from  ex¬ 
periments  Involving  only  group  interaction  and  no  learning. 
Finally,  the  validity  of  these  stepwise  estimates  for  indivi¬ 
duals  singly  and  in  pairs  mig^t  be  checked  experimentally  by 
testing  them  In  different  cofublnatlGns  or  in  different  expert- 
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mental  situations.  The  hope  would  be  that  eventually  the  ex¬ 
perimenter  would  find  a  parametric  structure  and  a  way  to  esti¬ 
mate  that  would  yield  values  that  ware  essentially  invariant 
from  one  experimental  situation  to  another  quite  different  ooa. 

Even  when  a  ain^^le  parameter  haa  been  Isolated  for  estima¬ 
tion  there  are  many  experiments  that  mi(^t  serve  for  the  purpose. 
For  example «  in  som  of  tae  learning  experiiaanta  already  done  to 
provide  estimates  of  parameters  in  the  learning  operator,  it  was 
found  simpler  to  test  whether  or  not  one  entire  range  of  values 
was  admissible  than  to  estimate  the  actual  value;  because  the 
asymptotic  behavior  was  qualitatively  different  inside  the  range 
and  outside  of  it.  In  fact,  qualitative  tests  of  this  sort, 
that  depend  on  mathematical  properties  derived  for  particular 
parameter  sets,  will  pro>^ably  provide  a  quicker  and  surer  way 
toward  acceptance  and  rejection  of  classes  of  models  than  will 
detailed  parameter  estimation  until  we  have  gai’^jed  a  far  better 
general  understanding  of  the  properties  and  applicability  of  the 
general  type  of  models  under  considaraticm. 

Specifically,  for  the  learning  operator  of  Model  0,  I  would 
expect  that  the  size  of  c^  would  depend  upon  the  strength  of 
reward  and  nor>— reward;  perhaps  the  larger  the  reward  the  larger 
the  value  of  c^,  as  ia  suggested  by  the  results  of  some  of  the 
experiments  reported  by  Hosteller  [o] .  In  our  model,  we  have 

a 

aude  no  direct  provision  for  a  funotional  dependence  of  on 

strength  of  reward,  although  we  could  eatlly  --  sc  Ir  a  variety 
of  waya.  At  present,  since  theory  does  not  • aggest  s  funotional 
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form,  It  will  probably  be  necessary  to  explore  the  dependence  of 
i 

c  on  strength  of  reward  by  conducting  experiments  designed  to 
yield  data  relevant  to  this  question.  With  success  In  this.  It 
ml^t  then  be  possible  to  replace  Model  Q  by  a  new  c»ne  In  which 
strength  of  reward  s  Is  taken  Into  account  directly  In  such  a 
way  that  c  (s)  will  be  Invariant  from  situation  to  situation. 

It  will  be  especially  Important  to  test  tentative  parameter 
estimates  for  predictive  validity  In  rather  different  situations. 
For  example.  If  the  learning  and  attitude  operators  are  estimated 
for  each  of  several  individuals  In  pairwise  group  experiments, 
the  Ideal  catfe  Is  that  in  idiich  the  parameter  estimates  for  a 
particular  individual  are  essentially  the  same  when  derived  fron 
several  pairwise  experiments  In  which  he  is  a  participant.  One 
way  to  study  this  kind  of  predictive  validity  Is  to  conduct  ex- 
perlraents  In  which  one  member  of  the  pair  Is  a  human  subject  and 
the  other  is  a  known  mathematical  model;  in  this  way,  the  para¬ 
meters  for  a  subject  can  be  estimated  over  a  sequence  of  situa¬ 
tions  under  the  control  of  the  experimenter  and  for  extreme  or 
critical  values  of  parameters.  One  especially  uservi  device, 
that  can  sometlmea  be  used,  is  tc  have  the  subject  assume  both 
roles  in  the  group  without  his  kr<owledge  so  as  to  ensure  symmetry 
in  the  model  end  inter-compereblll ty  of  utilities  and  rewards. 


Vn 
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